The Search for Gold Nuggets Using CRISP-DM Without a Seasoned Miner

نویسنده

  • P. W. Beers
چکیده

The rise of data mining has brought many changes to people’s lives but also to companies and the importance of data analysis. Companies always had a tendency to gather as much data as possible but it has only been recently due to the developments in IT that large quantities of data can be analyzed in a fast and easy way. This new field gave rise to the methodology of Cross Industry Standard Process for Data mining (CRISP-DM) and this method is the current standard of data mining. This process has been widely applied but has not been updated since its release in 1999. There have been many suggestions for improvements to the technique by researchers such as Clifton and Thuraisingham [3] as well as Zapata and Gil [13] and many others. This study will look into what improvements can be made to CRISP-DM. The improvements recommended by this study are based on a literature study as well as a field study at a company to observe a data mining process. There were many suggestions found in the literature to improve CRISP-DM and the field study showed that there is not always a project leader but a departmental structure present at a company making it harder to implement a methodology such as CRISP-DM. This paper has made six suggestions for improvements to CRISPDM which can result in new versions of CRISP-DM or even new data mining techniques.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology-guided intelligent data mining assistance: Combining declarative and procedural knowledge

The effective application of a data mining process is littered with many difficult and technical decisions (i.e. data cleansing, feature transformations, algorithms, parameters, evaluation). Subsequently, most data mining products provide a large number of models and tools, but few provide intelligent assistance for addressing the above-mentioned challenges that face the non-specialist data min...

متن کامل

FUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING

The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...

متن کامل

APPLICATION OF TABU SEARCH FOR SOLVING THE BI-OBJECTIVE WAREHOUSE PROBLEM IN A FUZZY ENVIRONMENT

The bi-objective warehouse problem in a crisp environment is often not eective in dealing with the imprecision or vagueness in the values of the problem parameters. To deal with such situations, several researchers have proposed that the parameters be represented as fuzzy numbers. We describe a new algorithm for fuzzy bi-objective warehouse problem using a ranking function followed by an applic...

متن کامل

Retaining Customers Using Clustering and Association Rules in Insurance Industry: A Case Study

This study clusters customers and finds the characteristics of different groups in a life insurance company in order to find a way for prediction of customer behavior based on payment. The approach is to use clustering and association rules based on CRISP-DM methodology in data mining. The researcher could classify customers of each policy in three different clusters, using association rules. A...

متن کامل

Implementation of CRISP Methodology for ERP Systems

Abstract— ERP systems contain huge amounts of data related to the actual execution of business processes. These systems have a particular way of recording activities which results in an unclear display of business processes in event logs. Several works have been conducted on ERP systems, most of them focusing on the development of new algorithms for the automatic discovery of business processes...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016